A divide-and-conquer approach to neural natural language generation from structured data
نویسندگان
چکیده
Current approaches that generate text from linked data for complex real-world domains can face problems including rich and sparse vocabularies as well learning examples of long varied sequences. In this article, we propose a novel divide-and-conquer approach automatically induces hierarchy “generation spaces” dataset semantic concepts texts. Generation spaces are based on notion similarity partial knowledge graphs represent the domain feed into sequence-to-sequence or memory-to-sequence learners concept-to-text generation. An advantage our is models exposed to most relevant during training which avoid bias towards majority samples. We evaluate two common benchmark datasets compare hierarchical against flat setup. also conduct comparison between models. Experiments show overcomes issues sparsity learns robust lexico-syntactic patterns, consistently outperforming baselines previous work by up 30%. find while outperform in some cases, latter generally more stable their performance safer overall choice.
منابع مشابه
A Divide and Conquer Approach to Sequencing
We suggest an approach to sequencing based on a \divide and conquer method". This approach eliminates the need for solving a hard NP-complete problem which arises when using traditional sequencing techniques. At the present time this algorithm has not been tried out in the lab. However, computer simulations using parts of the human genome provide some encouraging data.
متن کاملA Divide and Conquer Approach to Learning from Prior Knowledge
This paper introduces a new machine learning task|model calibration|and presents a method for solving a particularly diicult model calibration task that arose as part of a global climate change research project. The model calibration task is the problem of training the free parameters of a scientiic model in order to optimize the accuracy of the model for making future predictions. It is a form...
متن کاملA Divide-and-Conquer Approach for Language Identification Based on Recurrent Neural Networks
This paper describes the design of an acoustic language recognition system based on BLSTM that can discriminate closely related languages and dialects of the same language. We introduce a Divide-and-Conquer (D&C) method to quickly and successfully train an RNN-based multi-language classifier. Experiments compare this approach to the straightforward training of the same RNN, as well as to two wi...
متن کاملClustering for Data Reduction: A Divide and Conquer Approach
We consider the problem of reducing a potentially very large dataset to a subset of representative prototypes. Rather than searching over the entire space of prototypes, we first roughly divide the data into balanced clusters using bisecting k-means and spectral cuts, and then find the prototypes for each cluster by affinity propagation. We apply our algorithm to text data, where we perform an ...
متن کاملA divide-and-conquer approach for analysing overlaid data structures
We present a static program analysis for overlaid data structures such that a node in the structure includes links for multiple data structures and these links are intended to be used at the same time. These overlaid data structures are frequently used in systems code, in order to impose multiple types of indexing structures over the same set of nodes. Our analysis implements two main ideas. Th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2021
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2020.12.083